Search CORE

6 research outputs found

Inference with Reference: Lossless Acceleration of Large Language Models

Author: Ge Tao
Jiang Daxin
Jiao Binxing
Majumder Rangan
Wang Liang
Wei Furu
Yang Linjun
Yang Nan
Publication venue
Publication date: 10/04/2023
Field of study

We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans between the decoding result by an LLM and the reference that is available in many real world scenarios (e.g., retrieved documents). LLMA first selects a text span from the reference and copies its tokens to the decoder and then efficiently checks the tokens' appropriateness as the decoding result in parallel within one decoding step. The improved computational parallelism allows LLMA to achieve over 2x speed-up for LLMs with identical generation results as greedy decoding in many practical generation scenarios where significant overlap between in-context reference and outputs exists (e.g., search engines and multi-turn conversations).Comment: 9 page

arXiv.org e-Print Archive

BeamSearchQA: Large Language Models are Strong Zero-Shot QA Solver

Author: Dong Anlei
Duan Nan
Gong Yeyun
Jiang Daxin
Liu Xiao
Lu Jingwen
Majumder Rangan
Sun Hao
Yang Linjun
Zhang Yan
Publication venue
Publication date: 01/06/2023
Field of study

Open-domain question answering is a crucial task that often requires accessing external information. Existing methods typically adopt a single-turn retrieve-then-read approach, where relevant documents are first retrieved, and questions are then answered based on the retrieved information. However, there are cases where answering a question requires implicit knowledge that is not directly retrievable from the question itself. In this work, we propose a novel question-answering pipeline called BeamSearchQA. Our approach leverages large language models to iteratively generate new questions about the original question, enabling an iterative reasoning process. By iteratively refining and expanding the scope of the question, our method aims to capture and utilize hidden knowledge that may not be directly obtainable through retrieval. We evaluate our approach on the widely-used open-domain NQ and WebQ datasets. The experimental results demonstrate that BeamSearchQA significantly outperforms other zero-shot baselines, indicating its effectiveness in tackling the challenges of open-domain question answering.Comment: Work in progres

arXiv.org e-Print Archive

LEAD: Liberal Feature-based Distillation for Dense Retrieval

Author: Dong Anlei
Duan Nan
Gong Yeyun
Jiang Daxin
Jiao Jian
Liu Xiao
Lu Jingwen
Majumder Rangan
Sun Hao
Yang Linjun
Zhang Yan
Publication venue
Publication date: 10/12/2022
Field of study

Knowledge distillation is often used to transfer knowledge from a strong teacher model to a relatively weak student model. Traditional knowledge distillation methods include response-based methods and feature-based methods. Response-based methods are used the most widely but suffer from lower upper limit of model performance, while feature-based methods have constraints on the vocabularies and tokenizers. In this paper, we propose a tokenizer-free method liberal feature-based distillation (LEAD). LEAD aligns the distribution between teacher model and student model, which is effective, extendable, portable and has no requirements on vocabularies, tokenizer, or model architecture. Extensive experiments show the effectiveness of LEAD on several widely-used benchmarks, including MS MARCO Passage, TREC Passage 19, TREC Passage 20, MS MARCO Document, TREC Document 19 and TREC Document 20.Comment: Work in progres

arXiv.org e-Print Archive

The AI Gambit — Leveraging Artificial Intelligence to Combat Climate Change: Opportunities, Challenges, and Recommendations

Author: Abbas Mardani
Adrian Wheeldon
Alec Radford
Alexandre Lacoste
Anders Andrae
Andreas T Schmidt
Andreas Tsamados
Angela H Jiang
Ashesh Chattopadhyay
Barret Zoph
C Microsoft
Caleb Robinson
Casper S�nderby
Changhyun Choi
Charlotte Stix
Chris Huntingford
Christopher Berner
Chuan Li
Chun-Fu Chen
Cody Coleman
Cody Coleman
Dame Hall
Danny Hernandez
Dario Amodei
David Gagne
David Mytton
David Rolnick
Di Piazza
E M Bender
Edl
Eea
Eff
Elizabeth A Barnes
Elizabeth Gibney
Emma Strubell
Eric Masanet
Erik Dahlquist
Galal M Abdella
Gang Zheng
Garc�a-Mart�n
Gary Cook
Gary Marcus
Grigori Fursin
Guang-Zhong Yang
H Matthews
Ham
Han Cai
Hill
Hongfang Lu
Hussam Jouhara
Jaafari
Jacob Buckman
Jan Abrell
Jenny Cifuentes
Jens Malmodin
Jesse Dodge
John Mccarthy
Josh Cowls
K Gauen
Kate Crawford
Konstantinos G Liakos
Kunihiko Fukushima
Lasse F Anthony
Lokukaluge P Perera
Lotfi Belkhir
Luciano Floridi
Luciano Floridi
Luciano Floridi
Majumder Rangan
Manish Shrestha
Maria Avgerinou
Maria Xenochristou
Mariarosaria Taddeo
Mark Coeckelbergh
Masson-Delmotte
Matthew Hartley
Matthew Hutson
Md Alom
Microsoft
Muhammad Aftab
Nait Menad
Narciso
Neil C Thompson
Neil Thompson
Neurips
Nicola Jones
Nur Ahmed
Nvidia
Odd Gundersen
Oliver Inderwildi
Omar Y Al-Jarrah
P S M Thilakarathna
Pablo R Larraondo
Paperswithcode
Paul J Werbos
Payal Dhar
Peter Henderson
Pham
Rajendra K Pachauri
Ralph Hintemann
Rasp
Raymond Perrault
Richard Evans
Richard Sutton
Roel Dobbe
Roy Schwartz
Sara Hooker
Sayed-Mouchaweh
Shehabi
Sina Dabiri
Soheil Fathi
Stephen Russell
Steve Lohr
Sun Wei
Superglue
Takeshi Ise
Thomas N Theis
Tianqi Chen
Tom B Brown
Us Epa
Vijay Reddi
Vivienne Sze
Vtab
Wanie M Ridwan
William Fedus
William H Guss
Ye Tao
Yu-Hsin Chen
Zhifang Zhou
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Crossref

Developing greener food value chains: environmentally friendly tomato post-harvest operations in four cities

Author: ADB
AgriProFocus
Arvin-Rad H.
Aryeety E.
Banerjee A.
Benson E.
BOP Global Network
Brown D.
Carillo M.R.
Chambwera M.
Confederation of Danish Industries (DI)
De Pozo-Vergnes E.
Donor Committee for Enterprise Development (DCED)
Economic Commission for Latin America and the Caribbean (ECLAC)
Eggenberger-Argote N.
Equality For Growth (EFG)
FAO
FAO
FAO
FAO
FAO
Finnish Institute of Occupational Health (FIOH)
Frey B.S.
Garcia-Bolivar O.E.
GIZ
GIZ
GTZ
GTZ
Hart S.L.
Hitimana L.
IFC and WSI
ILO
ILO
ILO
ILO
International Labour Organization (ILO)
Jaiswal A.K.
Jha M.
Karnani A.
Khor M.
Kinyanjui M.N.
Krauel H.
Majumder M.
Medina M.
Nataraj B.
Ndumbaro F.G.J.
OECD
OECD
Organisation for Economic Co-operation and Development (OECD)
Prahalad C.K.
Prahalad C.K.
Ramani S.V.
Rangan V.K.
Ratner S.
Restrepo-Echavarría P.
Roever S.
Schneider F.
Sida
Simanis E.
Singh A.
Skinner C.
Slavova M.
Sparks D.L.
SwitchMed
SYNDICOOP
Tokman V.E.
UNECA
UNIDO
United Nations Economic and Social Commission for Western Asia (ESCWA)
United Nations Industrial Development Organization (UNIDO)
USAID
Van der Heijden T.
Vorley B.
Voveryte J.
WABA
WBCSD
WEF
Westlake M.
Westlake M.
WIEGO
WIEGO
WIEGO
WIEGO
WIEGO
World Bank
World Bank
WRI
WRI
Publication venue: 'Practical Action Publishing'
Publication date
Field of study

Crossref